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Abstract: In quantum theory, the curved spacetime of Einstein's general theory of rel- 
ativity acts as a dispersive optical medium for the propagation of light. Gravitational 
rainbows and birefringence replace the classical picture of light rays mapping out the null 
geodesies of curved spacetime. Even more remarkably, superluminal propagation becomes 
a real possibility, raising the question of whether it is possible to send signals into the past. 
In this article, we review recent developments in the quantum theory of light propagation 
in general relativity and discuss whether superluminal light is compatible with causality. 
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1. Time and Light 

Einstein's discovery of relativity nearly a century ago highlighted the fundamental nature of 
the speed of light and revolutionised our concept of time. In particular, relativity predicts 
that a signal moving faster than light would, from the viewpoint of some inertial observers, 
be moving backwards in time. The ideas of faster than light motion and time travel 
became inextricably linked. Recently, a new perspective on these issues has arisen from the 
study of quantum effects such as vacuum polarisation on the nature of light propagation 
in general relativity. The classical picture of light rays mapping out the geometry of 
curved spacetime gives way in quantum theory to a range of optical phenomena such as 
birefringence, lensing and dispersion. Gravitational rainbows appear. Most remarkably, 
even superluminal propagation appears to be possible. 

In this paper, we review some of these recent developments, focusing on the issue of 
superluminal light and its implications for causality. From the outset, we have to clarify 
what we mean by 'superluminal'. Special (general) relativity is characterised by global 
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(local) Lorentz invariance, that is the invariancc of the laws of physics in distinct inertial 
frames of reference. The Lorentz transformations introduce a new fundamental constant 
of nature, c, which in classical theory may be identified with the unique and unambigu- 
ous speed of light. In what follows, however, we must distinguish clearly between this 
fundamental constant and the actual propagation speed of light; when we use the word 
'superluminal', we simply mean 'greater than c'. The key point is that it is the null cones 
generated by the speed c which divide spacetimc into 'timelike' and 'spacclike' regions - 
a superluminal signal therefore travels into the spacelike region and, for some observers, 
back in time. As we shall see, light may do precisely this. 

The propagation of light is already a rich field in classical general relativity. In the 
presence of a gravitational field, spacetime becomes curved. Light rays follow null geodesies 
- geodesies being the analogues of shortest distant paths in curved spacetime, and null 
denoting that the frequency and wave vector satisfy the simple relation /c^ = c<j^ — | fcp = 
(where from now on, we use conventional units where c = 1) which is equivalent to the phase 
velocity Uph = io/\k\ = 1. Light rays therefore act as tracers, mapping out the curvature 
of spacetime. This immediately predicts the bending of light around a massive object, 
as observed by Eddington during the famous 1919 expedition to observe a solar eclipse in 
Brazil. Nowadays, this effect is the basis for the burgeoning science of gravitational lensing, 
which has become a crucial tool in the search for dark matter in the universe. 

In quantum field theory, however, a whole range of new phenomena emerge. Curved 
spacetime acts as an optical medium with its own refractive index. Dispersive effects 
(rainbows) and polarisation-dependent propagation (birefringence) arise. The simple iden- 
tification of the geometric null cones of curved spacetime with the physical light cones 
corresponding to the actual propagation of light pulses is lost. The field of quantum 
gravitational optics takes on a new richness. The big surprise, however, is the apparent 
prediction in some circumstances of a superluminal speed of light. 

These new developments were initiated by a pioneering paper in 1980 by Drummond 
and Hathrell [1], who studied the effects on light propagation of vacuum polarisation (see 
Fig. 1) in quantum electrodynamics. This is a uniquely quantum field theoretic process, in 
which a photon metamorphoses into a virtual electron-positron pair. This gives the photon 
an effective size characterised by the Compton wavelength Ac of the electron. It follows 
that its propagation will be affected by gravity if the scale of the spacetime curvature is 
comparable to Ac. Of course, such effects will be tiny for the curvatures associated with 
normal astrophysical objects. However, this is characteristic of the physics of quantum 
fields in curved spacetime, the most important example of which is the famous Hawk- 
ing [2] radiation from black holes. Like Hawking radiation, the main importance of the 
effects described here is likely to be conceptual, especially on their implications for our 
understanding of the realisation of causality in theories incorporating quantum theory and 
gravity. 

As Drummond and Hathrell showed, the effect of vacuum polarisation is to induce 

direct interactions of 0(a) between the electromagnetic field and the spacetime curvature. 
In other words, the classical Maxwell equations are modified by terms explicitly involving 
the curvature. This breaks the strong equivalence principle (SEP). As we explain care- 
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fully later, it is this effective violation of the SEP which opens up the possibility that 
superluminal propagation may exist without, as would be the case in special relativity, 
necessarily implying a breakdown of causality. This is the key conceptual issue raised by 
the introduction of quantum field theory into gravitational optics. 

The paper is arranged as follows. In section 2, we review briefly the key points of 
geometric optics, which allows us to pass from Maxwell's equations, or their quantum 
modifications, to the characteristics of light propagation. Vacuum polarisation in curved 
spacetime and the construction of the Drummond-Hathrell effective action is reviewed in 
section 3, and its implications for photon propagation in a variety of cosmological and black 
hole spacetimes are explored in section 4. Section 5 is devoted to the conceptual issues 
surrounding the compatibility of superluminal light with causality. Section 6 focuses on 
the crucial question of dispersion, explaining why the high-frequency limit is essential for 
causality and describing recent developments [3, 4, 5] in the quantum theory, including a 
generalised effective action aimed at clarifying the nature of high-frequency propagation. 
Finally, in section 7, we comment briefly on a number of speculative ideas in the current 
literature which reflect in some way on the physics reviewed here. 



2. Geometric Optics in Curved Spacetime 

Classically, the propagation of light in general relativity is governed by Maxwell's equation 
for the electromagnetic field, 

D^F^"" = (2.1) 

together with the Bianchi identity 

^[Ai^H = (2-2) 

which ensures F^y = d^A^y — dpA^. The simplest way to deduce the properties of light cones 
and light rays from these equations is to use geometric optics. (For a clear introduction, 
see for example ref.[6].) This starts from the ansatz 

(A^ + zeB^ + ...)exp(i-) (2.3) 

in which the electromagnetic field is written as a slowly-varying amplitude and a rapidly- 
varying phase. The parameter e is introduced as a device to keep track of the relative order 
of magnitude of terms, and the Bianchi and Maxwell equations are solved order-by-order 
in e. The wave vector is identified as the gradient of the phase, = d^-d. We also write 
Af^ = Attf^, where A represents the amplitude itself while specifies the polarisation, 
which satisfies k^a^ = 0. 

Solving the Maxwell equation, we find at 0(l/e), 

= (2.4) 

while at 0(1), 

k^'Df.a" = (2.5) 
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and 

k^'D^{lnA) = --D^k^' (2.6) 

Eq.(2.4) shows immediately that k^ is a null vector. Prom its definition as a gradient, 
we also see 

k^'Df.k'' = k^'D''kf, = -D^k"^ = (2.7) 

Light rays, or equivalently photon trajectories, are the integral curves of /c*^, i.e. the curves 
x'^{s) where dx^ [ds = k'^. These curves therefore satisfy 

j2 1/ J fi J A 

o = ,.i,,r = ^i + rj,^^^ (2.8) 

This is the geodesic equation. We conclude that for the usual Maxwell theory in general 
relativity, light follows null geodesies. Eqs.(2.7),(2.5) show that both the wave vector and 
the polarisation are parallel transported along these null geodesic rays, while Eq.(2.6), 
whose r.h.s. is just (minus) the optical scalar 9, shows how the amplitude changes as the 
beam of rays focuses or diverges. 

This picture allows us to identify the physical light cones, specified by the dispersion 
relation k'^ = 0, with the geometric null cones of the background spacetime. Moreover, 
light propagation is non-dispersive, i.e. light of all frcqTicncies (or equivalently, photons of 
all momenta), travel at the same unique speed c, the fundamental constant determining the 
local Lorentz invariance of curved spacetime. Of course, this is why the distinction between 
'timelike' and 'spacelike' directions and more generally the analysis of the causal properties 
of curved spacetime is traditionally expressed in the language of light signals. However, 
as we shall now see, this whole picture has to be radically revised when we consider the 
quantum nature of light and its interaction with gravity. 

3. Vacuum Polarisation 

The quantum theory of photons interacting with electrons is quantum electrodynamics, so 
in this section we take a first look at the effects of QED interactions on photon propagation 
in curved spacetime, i.e. in a classical background gravitational field. 

The simplest Feynman diagram contributing to pho- 
ton propagation is the so-called 'vacuum polarisation' 
process shown in Fig. 1. In picturesque terms, this 
can be thought of as the photon splitting into a vir- 
tual electron-positron pair which subsequently recom- Figure 1: The 0(a) vacuum po- 
bine. The effect is, in a certain sense, to give the pho- larisation Feynman diagram con- 
ton an effective size characterised by the Compton wave- tributing to the photon propagator 

1 ,1 X -, / r J.1 1 i /I • -J. \ (wavy line). The solid lines rcpre- 

length Ac = l/m of the electron (where m is its mass). ^ ^ 

^ . , „ , sent the electron (positron) propa- 

It is therefore possible to think of the photon propagat- ^ . , 

_ r- r- r- o gator m curved spacetime. 

ing as if it were a quantum cloud of size 0(Ac) rather 

than a point particle. It is then plausible that if the photon passes through an anisotropic 
curved spacetime whose typical curvature scale L is comparable to Ac, then its motion 
would be affected, possibly in a polarisation-dependent way. 
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A rigorous analysis in QED confirms this picture. The light-cone is indeed altered 
and the photon acquires a polarisation-dependent correction to its velocity of 0(aA^/L^). 
What is less predictable is that in some cases these quantum corrections apparently produce 
superluminal velocities! 

Technically, the consequences of vacuum polarisation are summarised in the form of 
an effective action. At one-loop order, the QED effective action is given by 

r = rM + lndetS'(x,a;') (3.1) 

where 

Tm = -\J dx^ F^.F'^'' (3.2) 

is the free Maxwell action (which gives rise to the standard Maxwell equation (2.1)) and 
S{x, x') is the Green function of the Dirac operator (i.e. the electron propagator) in the 
background gravitational field, i.e. 

{ip - m)S{x, x') = -^^^{^^ x') (3-3) 

In fact it is more convenient to work with the differential operator corresponding to the 
scalar Green function G{x,x') defined by 

S{x, x') = {ip + m)G{x, x') (3.4) 

so that 

(^D^ + iea'^'Ff,^ -\^ + = " ^'^ ^^'^^ 

Then we evaluate F from the heat kernel, or proper time, representation 



r = Tm 

where 



1 f°° fJs -2 

/ ^e-*-^Trg(x,x';s) (3.6) 



Vg{x, x'; s) = i-^G{x, x'; s) (3.7) 

with Q{x,x';0) = 5{x,x'). Here, V is the differential operator in Eq.(3.5) at m = 0. 

A complete evaluation of this effective action, even at 0{a), is impossible with current 
techniques. We are forced to make two crucial approximations. The first is a weak-field 

approximation for gravity - this means we keep only terms in the final effective action 
of first order in the curvature tensors R, R^^, and R^^Xp (the Ricci scalar, Ricci tensor 
and Riemann tensor respectively). This implies our results for photon propagation are 
valid only to lowest order in the parameter A^/L^. The second, which we shall try to 
overcome in section 6, is a low-frequency approximation - this means that we neglect 
terms induced in the effective action which involve higher orders in derivatives of the fields. 
The resulting leading-order effective action was first calculated by Drummond and Hathrell 
in their pioneering paper on superluminal light [1] . Fortunately, this already encodes much 
of the novel physics of photon propagation in curved spacetime. 
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The Drummond-Hathrell action is 



Tdh = ^M + ^J dx^ (^aRF^.F^^- + hR^.^Ff^^F'^ x + cR^.^xpF^''' F^p 

+dD^F^'^D^F''x ) (3.8) 



where a,b,c,d are perturbative coefficients of 0{a), viz. 

1 a , 13 a la , la /„ 

a = b= c = d= (3.9) 

144 TT 360 TT 360 TT 30 vr ^ ' 

4. Faster than Light? 

In this section, we show how photon propagation is modified by these new interactions 
between photons and the gravitational field. As we shall see, this brings some surprises, 
notably the possibility of superluminal velocities. 

4.1 Light Cones and Birefringence 

The first step is to apply the geometric optics method described in section 2 to the equation 
of motion derived from the quantum effective action (3.8), viz: 



D,,F'"' - ^ 



2hR^xDi'F^'' + AcR/xpD^'F^P 



= (4.1) 



Here, we have neglected derivatives of the curvature tensor, which would be suppressed by 
powers of 0(A/L), where A is the photon wavelength and L is a typical curvature scale, 
and we have omitted the new contributions involving D^F^^'^ : since this term is already 
0(a) using the equations of motion, these contributions only affect the light cone condition 
at O(a^) and must be dropped for consistency. 

Implementing the geometric optics ansatz, we quickly find the new light cone condition: 

- ^R^xk^k^ + ^R.^xpk^k^a^aP = (4.2) 

Since this is still homogeneous and quadratic in k^^, we can write it as 

g^^kf.k, = (4.3) 

defining Q^'^ as the appropriate function of the curvature and polarisation. This immedi- 
ately means that, at least in this approximation, there is no dispersion.^ 

Now notice that in the discussion of the free Maxwell theory, we did not need to 
distinguish between the photon momentum p^, i.e. the tangent vector to the light rays, 
and the wave vector k^ since they were simply related by raising the index using the 
spacetime metric, = g^^ky. In the modified theory, however, there is an important 
distinction. The wave vector, defined as the gradient of the phase, is a covariant vector or 



^Notice also that this is quite different from the dispersion relation = —rn? for a 'conventional' 
tachyon. In this case (see section 5), the superluminal tachyon velocity is energy dependent. 
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1-form, whereas the photon momentum/tangent vector to the rays is a true contravariant 
vector. The relation is non-trivial. In fact, given k^, we should define the corresponding 
'momentum' as 

= g^^^K (4.4) 

and the light rays as curves where = p^^. This definition of momentum satisfies 

G^^p'^p" = g^'k^K = (4.5) 

where G = g~^ defines a new effective metric which determines the light cones mapped 
out by the geometric optics light rays. (Indices are always raised or lowered using the 
true metric g^n,.) The ray velocity Vray corresponding to the momentum p'*, which is the 
velocity with which the equal-phase surfaces advance, is given by (defining components in 
an orthonormal frame) 

along the ray. This is in general different from the phase velocity 

^0 

-Ph = ^ (4.7) 

although this discrepancy only arises when there is a difference between the direction of 
propagation along the rays and the wave 3-vector. Otherwise, it follows directly from 4.5 
that Uray and f ph are identical. 

This shows that, at least at this level of approximation, photon propagation for QED 
in curved spacetime can be characterised as a bimetric theory - the physical light cones 
are determined by the effective metric G^i, and are distinct from the geometric null cones 
which are fixed by the spacetime metric 

It is instructive to rewrite the new light cone condition in terms of the Wcyl tensor 
(the trace-free part of the Riemann tensor) and the energy-momentum tensor, using the 
Einstein field equation 

Ri^u - ^Rgt^v = SttT^^ (4.8) 



This gives [7]: 



k^ - ^{2b + Ac)T^xk^k^ + ^C^.^xph^k^a'^aP = (4.9) 



which splits the light-cone corrections into two pieces, the first representing the effect 
of matter and the second the gravitational field. The matter contribution turns out to 
be relatively universal [7, 8, 9, 10, 11]; the same formula accommodates the effects on 
photon propagation due to background magnetic fields, Casimir cavities, finite temperature 
environments, etc. It involves a projection of the energy-momentum tensor whose sign is 
fixed by the weak-energy condition, and in all non-gravitational situations this correction 
leads to a timelike p'^, i.e. a reduction in the speed of light. The second contribution, 
however, is unique to gravity. 
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It is immediately clear that the contribution of the Weyl tensor to the light-cone shift 
depends on the photon polarisation a^. This interaction therefore produces a polarisation- 
dependent shift in the velocity of light - gravitational birefringence. Moreover, it has a 
special property which is not immediately evident but can be proved simply using symmetry 
properties of the Weyl tensor (see [7]). The two physical transverse polarisations give a 
contribution of the same magnitude but of opposite sign. This has a striking consequence: 
if we consider Ricci-flat spacetimes, so that the Weyl interaction is the only contribution 
to the light-cone shift, it follows that if one polarisation produces a timelike shift in p'^, the 
other must necessarily produce a spacelike shift. That is, the light cone (4.9) necessarily 
predicts superluminal velocities! 

4.2 Cosmological and Black Hole Spacetimes 

To understand these new phenomena in more detail, we now look at two specific ex- 
amples. First, we consider the Friedmann-Robertson- Walker (FRW) spacetime which de- 
scribes standard big-bang cosmology - this is Weyl-flat so only the energy-momentum term 
in Eq.(4.9) contributes to the velocity shift. Second, we consider a Ricci fiat case, where 
only the Weyl term contributes, viz. the Schwarzschild spacetime describing a non-rotating 
black hole. 

The FRW metric is 

ds'^ = dt^ - R^{t)[Y^^ + r^{d9^ + sin^ ^#^)) (4.10) 

where R{t) is the scale factor and k = 0, ±1 determines whether the 3-space is flat, open 
or closed. The energy-momentum tensor is 

T^, = (p + P)n^n, - Pg,,, (4.11) 

with specifying the time direction in a comoving orthonormal frame, p is the energy 
density and P is the pressure, which in a radiation-dominated era are related by p — 3P = 0. 
Since the FRW metric is Weyl flat, the light cone (4.9) is simply 

k^ = C T^^^k^k" (4.12) 

where ( = ^{2b + 4c) . The corresponding phase velocity 

«ph = l + ^C(p + ^) (4.13) 

is independent of polarisation and is found to be superluminal [1]. 

At first sight, this may look surprising given that k"^ > 0, its sign fixed by the weak 
energy condition T^^k^^k'^ > 0. However, if instead we consider the momentum along the 
rays, p'^ = Q^^ky, we find 

/ = = -C(P + P){jP? (4.14) 

and 

^ray = ^ = l + k(P + ^') (4.15) 
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The effective metric G = Q ^ is (in the orthonormal frame) 



G = 



fi + Cp ° \ 

-(i-C^') 

-(1-C-P) 

V -{i-CP)J 



(4.16) 



In this case, therefore, we find equal and superluminal velocities Vp^^ = Vray and < is 
manifestly spacelike as required. 

For a second example [1, 12, 13], consider the Ricci-fiat Schwarzschild spacetime, with 
metric 

= (l- '^yt' - (l - ^) '^dr-" - r\de^ + sin^ ^d^^) (4 

In the classical theory, with = 0, there is a special solution of the null geodesic equations 
describing a light ray in a circular orbit with specified radius r = 3M. Using the new light 
cone (4.9) and choosing A;^ and to represent photons in a circular orbit with transverse 
polarisations, we find 

where is the time component of the wave vector in an orthonormal frame. The phase 
velocity is therefore 

k^ AcSM , , 

^'ph = 73 = 1 ± (4.19) 

and evaluating at r = 3M gives the phase velocity of the new circular orbit, up to 0{a). 
Crucially, we see that one of the polarisations has a superluminal velocity. 

The radius of the circular photon orbit is changed to r = 3AI + 0{a), and similarly the 
location of the effective ergosphere for the rotating black hole (Kerr metric) is shifted by 
polarisation-dependent 0{a) corrections due to the modified speed of light [13]. However, 
the event horizon itself has a special status [7, 14]. It can be shown that for a normal- 
directed null vector k^, both terms in the modified light cone (4.9) vanish [15]. It seems that 
while in general the quantum-corrected light cones differ from the geometric null cones, the 
geometric event horizon remains a true horizon for the propagation of physical photons. 

4.3 Observability? 

A similar calculation in the Schwarzschild metric shows that the angle of deflection of light 
around a star (one of the classic tests of general relativity) is modified. If the classical 
deflection angle is A^, then the quantum shift is given by [1] 

(5(A</.) = ±^i^A0 (4.20) 

where R is the distance of closest approach. This gravitational birefringence effect would 
also mean that there is in principle a polarisation dependence in gravitational lensing. Is 
this observable? 

The problem is that the magnitude of all these quantum gravitational optics phenom- 
ena is incredibly small for astrophysical scales. Recall that the order of magnitude of the 
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shift in the speed of hght is 0{a}^/ L?) for a typical curvature scale L. It is therefore 
suppressed by the square of the ratio of a quantum scale (Ac) to an astrophysical curva- 
ture scale L. This is tiny - for a solar mass black hole the correction (4.19) to the speed 
of light is 0{ampi/{m?M'^)) ~ O(10~^^), while the correction (4.20) to the bending of 
light for solar parameters is only of O(10~^^). Of course this is not surprising - it follows 
immediately from the discussion of the physics of vacuum polarisation in section 3 - and 
is typical of all phenomena in the theory of quantum fields in curved spacetimc, such as 
the famous example of Hawking radiation from black holes. The effect only becomes large 
for quantum-scale black holes, which may have arisen in the very early universe. Like 
Hawking radiation, therefore, the main interest in this phenomenon of quantum-induced 
superluminal propagation is probably conceptual: the existence of thermal Hawking radia- 
tion forces us to reconsider the consistency of quantum mechanics with gravity, while this 
superluminal effect challenges our understanding of time and causality. 

Similarly, in the case of the FRW spacetime, the change in the speed of light only 
becomes appreciable when the curvature becomes of order the quantum scale, at which 
point our weak-field approximation breaks down and we can only guess at the possible 
effects. Nevertheless, in the radiation dominated era, where p{t) = ^t~^, we have 

^ph = 1 + t-' (4.21) 

which, as already observed in rcf.[l], increases towards the early universe. Although this 
expression is only reliable in the perturbative weak-field regime, it is nonetheless intriguing 
that QED predicts a rise in the speed of light in the early universe. It is interesting 
to speculate whether this superluminal effect persists for high curvatures near the initial 
singularity and whether it could play a role in resolving the horizon problem in cosmology. 

Beyond these potential cosmological or astrophysical effects, there is a potential ob- 
struction in principle to the observation of these superluminal phenomena. The argument 
runs as follows [1] . If we assume that a typical time over which propagation can be followed 
is characterised by the curvature scale L, then the length difference between paths corre- 
sponding to different polarisations is aX^/L. Very loosely, let us say that observability 
requires this to be 0(A), where A is the wavelength of the light. The condition for observ- 
ability is therefore a\l/{\L) > 1. But the derivation required not only the weak-field 
assumption L ^ Xc but also assumed (because of the truncated derivative expansion in 
the action) that the photon frequency is small, i.e. A S> Ac. Now in practice, spectroscopic 
techniques allow very much more sensitive detection so the 0(1) on the right hand side of 
the observability criterion could be many orders of magnitude less (indeed factors of 10^*^ 
are achieved in the interferometry used in gravitational wave detectors). Nevertheless, this 
does mean that to ensure that the superluminal phenomenon is genuinely observable as a 
matter of principle and that there is no absolute obstruction to its measurement, we need to 
extend the derivation of the light cone condition beyond the low-frequency approximation. 
This is addressed in section 6, where we study the whole question of dispersion. 
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5. Causality 



The ability to send signals 'faster than light' is inextricably linked in most people's minds 
with the possibility of time travel. However, this identification is founded mainly on intu- 
ition arising from special relativity. As we shall see, the correspondence between superlu- 
minal propagation and causality in general relativity is rather more subtle. 

5.1 Super luminal Propagation in Special and General Relativity 

We therefore begin by considering superluminal propagation in special relativity. The first 
important observation is that given a superluminal signal we can always find a reference 
frame in which it is travelling backwards in time. This is illustrated in Fig. 2. 




Figure 2: A superluminal (w > 1) signal OA which is forwards in time in frame S is backwards 
in time in a frame S' moving relative to S with speed u > ^. However, the return path with the 
same speed in S arrives at B in the future light cone of O, independent of the frame. 

Suppose we send a signal from O to A at speed v > 1 (in c = 1 units) in frame S with 
coordinates {t,x). In a frame S' moving with respect to S with velocity u > i, the signal 
travels backwards in t' time, as follows immediately from the Lorentz transformation. To 
see this in detail, recall that from the Lorentz transformations we have t'j^ = 7(ii)t^(l — ) 
and x'^ = 7(ti)xA(l — ^) For the situation realised in Fig. 2, we require both > and 
< 0, that is ^ < u < V, which admits a solution only if v > 1. 

The important point for our considerations is that this by itself does not necessarily 
imply a violation of causality. For this, we require that the signal can be returned from A 
to a point in the past light cone of O. However, if we return the signal from A to B with the 
same speed in frame 5, then of course it arrives at B in the future cone of O. The situation 
is physically equivalent in the Lorentz boosted frame S' - the return signal travels forward 
in t' time and arrives at B in the future cone of O. This, unlike the assignment of spacetime 
coordinates, is a frame-independent statement. 



- 11 - 



The problem with causality arises from the scenario 
illustrated in Fig. 3. Clearly, if a backwards- in-time sig- 
nal OC is possible in frame S, then a return signal sent 
with the same speed will arrive at D in the past light 
cone of O creating a closed time loop OCDO. The cru- 
cial point is that local Lorentz invariance of the laws of 
motion implies that if a superluminal signal such as OA 
is possible, then so is one of type OC ^, since it is just 
given by an appropriate Lorentz boost. The existence 
of global inertial frames then guarantees the existence 
of the return signal CD (in contrast to the situation in 
Fig. 2 viewed in the S' frame). 



The moral is that both conditions must be met in or- 
der to guarantee the occurrence of unacceptable closed 
time loops - the existence of a superluminal signal and 
global Lorentz invariance. Of course, since global Lorentz 
invariance (the existence of global inertial frames) is the 
essential part of the structure of special relativity, we 
recover the conventional wisdom that in this theory, su- 
perluminal propagation is indeed in conflict with causality. 



Figure 3: A superluminal {v > 1) 
signal OC which is backwards in 
time in frame S is returned at the 
same speed to point D in the past 
light cone of O, creating a closed 
time loop. 



^For example, suppose we have a conventional tachyon with a spacelike momentum — —m? < 0. 
Setting — dx'^/ds so that its velocity is vtach = d\x\/dt = IpI/p", we find vtach ~ yl+^r where we 
have used the notation E = and as usual set c = 1. The velocity therefore depends on the energy 
(note that the dispersion relation p^ +vn? = 0, while Lorentz invariant, is not homogeneous in p), so by an 
appropriate Lorentz transformation we can change the superluminal tachyon velocity at will. The general 
result is that if the dynamical equations are invariant under local Lorentz transformations and superluminal 
motion of type OA (Fig. 2) is possible, then so is 'backwards-in-time' motion of type OC (Fig. 3). 
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So much for special relativity. The situation is, however, crucially different in general 
relativity. This is formulated on the basis of the weak equivalence principle, which we un- 
derstand here as the statement that a local inertial frame exists at each point in spacetime. 
This implies that spacetime in general relativity is a Riemannian manifold. However, local 
Lorentz invariance alone is not sufficient to establish the link between superluminal propa- 
gation and causality violation. This is usually established by adding a second, dynamical, 
assumption. The strong equivalence principle (SEP) states that the laws of physics should 
be identical in the local frames at different points in spacetime, and that they should reduce 
to their special relativistic forms at the origin of each local frame. It is the SEP which takes 
over the role of the existence of global inertial frames in special relativity in establishing 
the incompatibility of superluminal propagation and causality. 

However, unlike the weak equivalence principle, which underpins the essential struc- 
ture of general relativity, the SEP is merely a simplifying assumption about the dynamics 
of matter coupled to gravitational fields. Mathematically, it requires that matter or elec- 
tromagnetism is minimally coupled to gravity, i.e. with interactions depending only on 
the connections but not the local curvature. This ensures that at the origin of a local 
frame, where the connections may be Lorentz transformed locally to zero, the dynamical 
equations recover their special relativistic form. In particular, the SEP is violated by in- 
teractions which explicitly involve the curvature, such as those occurring in the quantum 
Drummond-Hathrell action and the consequent modified light cones. 

We discuss below the question of whether this specific realisation of superluminal 
propagation is in conflict with causality, using the concept of stable causality described in 
ref.[16]. Notice though that by violating the SEP, we have evaded the necessary association 
of superluminal motion with causality violation that held in special relativity. Referring 
back to the figures, what is established is the existence of a signal of type OA, which as 
we saw, does not by itself imply problems with causality even though frames such as S' 
exist locally with respect to which motion is backwards in time. However, since the SEP 
is broken, even if a local frame exists in which the signal looks like OC, it does not follow 
that a return path CD is allowed. The signal propagation is fixed, determined locally by 
the spacetime curvature. 

5.2 Stable Causality 

One special case where causality is realised in a particularly simple way is in globally 
hyperbolic spacetimes, where the manifold admits a foliation into a set of spacclike Cauchy 
surfaces with fibres given by timelike geodesies. It is not hard to imagine that the same 
structure could be preserved using the effective metric G^i^ to define 'spacelike' or 'timelike', 
especially if Gfj,,^ is only perturbatively different from the actual spacetime metric g^i,. But 
this would be a global question and the preservation of global hyperbolicity is not a priori 
guaranteed. 

The clearest criterion for causality in general involves the concept of stable causal- 
ity discussed, for example, in the monograph of Hawking and Ellis [16] (see also [17]). 
Proposition 6.4.9 states the required definition and theorem: 
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• A spacetime manifold {Ai,g^i,) is stably causal if the metric 5^,^ has an open neigh- 
bourhood such that Ai has no closed timelike or null curves with respect to any metric 
belonging to that neighbourhood. 

• Stable causality holds everywhere on M if and only if there is a globally defined function 
/ whose gradient D^/ is everywhere non-zero and timelike with respect to g^,^. 

According to this theorem, the absence of causality violation in the form of closed 
timelike or lightlike curves is assured if we can find a globally defined function / whose 
gradient is timelike with respect to the effective metric G^i, for light propagation. / then 
acts as a global time coordinate. 

We can immediately give one example where this condition does hold. This is the FRW 
spacetime discussed in section 4.2. In this case, the effective metric (in the unperturbed 
orthonormal frame) is given by (4.16). We simply use the cosmological time coordinate t as 
the globally defined function /. We need only then check that D^t defines a timelike vector 
with respect to the effective metric G^y. This is true provided Goo > 0, which is certainly 
satisfied by Eq.(4.16). So at least in this case, superluminal propagation is compatible with 
causality. 



6. Gravity's Rainbow 

So far, our discussion of photon propagation has been based on the low-frequency approx- 
imation arising from the effective action (3.8). However, we know that this cannot be the 
whole story. In this section, we investigate dispersion - how the speed of light depends on 
its frequency - and explain carefully why the high-frequency limit is critical for causality. 
We begin with a review of some basic properties of light propagation in classical optics. 

6.1 The 'Speeds of Light' 

An illuminating discussion of wave propagation in a simple dispersive medium is given in 
the classic work by Brillouin [18]. This considers propagation of a sharp-fronted pulse of 
waves in a medium with a single absorption band, with refractive index n{ui): 



a 



2 



n^uj) = 1 - 5 — -— (6.1) 

where a, p are constants and is the characteristic frequency of the medium. Five^ distinct 
velocities are identified: the phase velocity Vp^i = J^ = ^Tlfb)' 9™'^P velocity = 
signal velocity -z^ig, energy-transfer velocity v^n and wavefront velocity v^f, with precise 
definitions related to the behaviour of contours and saddle points in the relevant Fourier 
integrals in the complex tj-plane. Their frequency dependence is illustrated in Fig. 4. 

As the pulse propagates, the first disturbances to arrive are very small amplitude waves, 
'frontrunners', which define the wavefront velocity w^f- These are followed continuously 
by waves with amplitudes comparable to the initial pulse; the arrival of this part of the 



^In fact, if we take into account the distinction discussed in section 4 between the phase velocity Vph and 
the ray velocity Uray , and include the fundamental speed of light constant c from the Lorentz transformations, 
we arrive at seven distinct definitions of 'speed of light'. 
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(velocity) 




Figure 4: Sketch of the behaviour of the phase, group and signal velocities with frequency in 
the model described by the refractive index Eq.(3.1). The energy-transfer velocity (not shown) is 
always less than c and becomes small near ojq. The wavefront speed is identically equal to c. 



complete waveform is identified in ref.[18] as the signal velocity Vsig- As can be seen from 
Fig. 4, it essentially coincides with the more familiar group velocity for frequencies far 
from ujQ, but gives a much more intuitively reasonable sense of the propagation of a signal 
than the group velocity, whose behaviour in the vicinity of an absorption band is relatively 
eccentric. 



n(w) 



In passing, notice that it is the 
group velocity which is measured in 
quantum optics experiments which find 
light speeds of essentially zero [19] or 
many times c [20]. For example, in 
the experiments of Vestergaard Hau 
and colleagues at Harvard, the group 
velocity of a light pulse is reduced al- 
most to zero by shining tuned lasers 
on a cloud of ultra-cold sodium atoms. 
The set-up is designed to produce an 
effective refractive index of the cloud 
of the form shown in Fig. 5. A very 
clear explanation of how this is achieved 

by manipulating the states of the sodium atoms in the gas cloud and exploiting quantum 
superposition is given in ref.[19]. Now, some simple algebra shows that the group velocity 




Figure 5: The effective refractive index of the cloud 
of ultra-cold sodium atoms used in the 'slow light' ex- 
periments. 



can be expressed in terms of the slope of the refractive index: 



n + u! 



dn 
duj , 



The 
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group velocity therefore becomes small in a region where the slope of the refractive index 
is large. In the 'slow-light' experiments, the central section of the curve n{Lo) in Fig. 5 
is made as steep as possible and the frequency of the probe laser tuned to this frequency 
range, forcing the group velocity towards zero. 

Returning to Fig. 4, it is clear that the phase velocity itself also does not represent a 
'speed of light' relevant for considerations of signal propagation or causality. The appropri- 
ate velocity to define light cones and causality is in fact the wavefront velocity t;„f . (Notice 
that in Fig. 4, u^f is a constant, equal to c, independent of the frequency or details of the 
absorption band.) This is determined by the boundary between the regions of zero and 
non-zero disturbance (more generally, a discontinuity in the first or higher derivative of 
the field) as the pulse propagates. Mathematically, this definition of wavefront is identified 
with the characteristics of the partial differential equation governing the wave propagation 
[21]. Our problem is therefore to determine the velocity associated with the characteristics 
of the wave operator derived from the modified Maxwell equations of motion appropriate 
to the new effective action. 

Notice that a very complete and rigorous discussion of the wave equation in curved 
spacetime has already been given in the monograph by Priedlander[22], in which it is proved 
(Theorem 3.2.1) that the characteristics are simply the null hypersurfaces of the spacetime 
manifold, in other words that the wavefront always propagates with the fundamental speed 
c. However, this discussion assumes the standard form of the (gauge-fixed) Maxwell wave 
equation and does not cover the modified wave equation (4.1), precisely because of the extra 
curvature couplings which lead to the effective metric G^i, and superluminal propagation. 

6.2 Characteristics and Wavefronts 

Instead, the key result which allows a derivation of the wavefront velocity is derived by 
Leontovich [23]. In this paper^, an elegant proof is presented for a very general set of PDEs 
that the wavefront velocity associated with the characteristics is identical to the a; — > oo 
limit of the phase velocity, i.e. 

Vwf = lim -^^ = lim Vph(a;) (6.2) 

OJ^OO |fc| OJ— >(X) 

The proof is rather formal, but is of sufficient generality to apply to our discussion of 
photon propagation using the modified effective action of section 3. We reproduce the 
essential details below. 

The first step is to recognise that any second order PDE can be written as a system of 
first order PDEs by considering the first derivatives of the field as independent variables. 
Thus, if for simplicity we consider a general second order wave equation for a field u(t, x) 
in one space dimension, the system of PDEs we need to solve is 

(^ij-Qf + bij-g^ + Cij(l)j = (6.3) 

*I am very grateful to A. Dolgov, V. Khoze and I. Khriplovich for their help in obtaining and interpreting 
ref.[23]. 
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where (A» = {^,t,i|}. 

Making the 'geometric optics' ansatz 



(f)i = (pi exp i{wt — kx) 



(6.4) 



where the frequency-dependent phase velocity is Vph{k) = uj{k)/k, and substituting into 
Eq.(6.3) we find 

{^uaij — ikbij + Cij^ ipj = (6-5) 

The condition for a solution, 



det 



Hj'Vphik^ bij . Cij 



k 



= 



(6.6) 



then determines the phase velocity. 

On the other hand, we need to find the characteristics of Eq.(6.3), i.e. curves C on which 
Cauchy's theorem breaks down and the evolution is not uniquely determined by the initial 
data on C. The derivatives of the field may be discontinuous across the characteristics 
and these curves are associated with the wavefronts for the propagation of a sharp-fronted 
pulse. The corresponding light rays are the 'bicharacteristics'. (See, for example, ref.[21] 
chapters 5.1, 6.1 for further discussion.) 

We therefore consider a characteristic curve C in the {t, x) plane separating regions 
where = (ahead of the wavefront) from (f)i ^ (behind the wavefront). At a fixed 
point {to,xo) on C, the absolute derivative of (pi along the curve, parametrised as x{t), is 
just 



dt 



dt 



+ 



dx 



dx 
dt 



(6.7) 



where dx/dt = ■u^f gives the wavefront velocity. Using this to eliminate ^ from the PDE 
Eq.(6.3) at (to,a^o)) we find 



dx 



dx 



+ ai 



dt 



+ Cij<t>j 



(0) _ 







(6.8) 



Now since C is a wavefront, on one side of which 0j vanishes identically, the second two 
terms above must be zero. The condition for the remaining equation to have a solution is 
simply 



det 



O'ijVyfi bij 







(6.9) 



which determines the wavefront velocity v^f- The proof is now evident. Comparing 
Eqs.(6.6) and (6.9), we clearly identify 



^^wf = Vph{k -> oo) 



(6.10) 



The wavefront velocity in a gravitational background is therefore not given a priori by 
c. Taking vacuum polarisation into account, there is no simple non-dispersive medium cor- 
responding to the vacuum of classical Maxwell theory in which the phase velocity represents 
a true speed of propagation; for QED in curved spacetime, even the vacuum is dispersive. 
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In order to discuss causality, we therefore have to extend the original Drummond-Hathrell 
results for Vphi'^ ~ 0) to the high frequency limit Vphif^ oo), as already emphasised in 
their original work. 

A further subtle question arises if we write the standard dispersion relation for the 
refractive index n{Lo) in the limit a; — ^ oo: 



n(oo) = n(0) - - / —Qn{uj) (6.11) 



For a conventional dispersive medium, Qn{io) > 0, which implies that n(oo) < n(0), 
or equivalently Vph(c!o) > Uph(O). Evidently this is satisfied by Fig. 4. The key question 
though is whether the usual assumption of positivity of 9n(ix') holds in the present situation 
of the QED vacuum in a gravitational field. If so, then (as already noted in ref.[l]) the 
superluminal Drummond-Hathrell results for fph(O) would actually be lower bounds on the 
all-important wavefront velocity Vph{oo). However, it is not clear that positivity of Qn{io) 
holds in the gravitational context. Indeed it has been explicitly criticised by Dolgov and 
Khriplovich in refs.[24, 25], who point out that since gravity is an inhomogeneous medium in 
which beam focusing as well as diverging can happen, a growth in amplitude corresponding 
to 9n(w) < is possible. The possibility of Vp\i{oo) < fph(O), and in particular VpY^{oo) = c, 
therefore does not seem to be convincingly ruled out by the dispersion relation Eq.(6.11). 

6.3 Light in Magnetic Fields 

Before confronting the gravitational case, we can build up some intuition on dispersion 
by looking at a closely related situation - the propagation of light through a background 
magnetic field. In this case, the analogTic of the low-frequency, weak-field Drummond- 
Hathrell action for gravity is the Euler-Heisenberg action: 



(6.12) 



where p = and q = —^a^. For a constant magnetic field, a familiar geometric optics 
analysis produces the following results for the phase velocity of photons moving transverse 
to the background field: 



(2p + 4g)|,,px Ti^^k'^k" = (6.13) 



where T^^i^ = F^^F\y — ^g^^FxpF-^^ is the electromagnetic energy-momentum tensor and 
the suffices ||,_L indicate the two polarisations. This should be compared with Eq.(4.9) 
for the gravitational case. Notice that for electromagnetism, birefringence appears already 
in the energy-momentum tensor term, essentially because the lowest order terms in the 
Euler-Heisenberg action contributing to (6.13) are of 4th order in the background fields, 
i.e. O(-F^), compared to the 3rd order terms of 0{RFF) in the Drummond-Hathrell action. 
Substituting for p,q we find the well-known results for the low-frequency limit of the phase 
velocity: 

a /eS\2 rl4 8 i 
»pH(-^0)~l-^(y f-.l,l (6.14) 



45||' 45 ± 
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As expected, < 1 for both polarisations. 

The refractive index for photons of arbitrary frequency has been calculated explicitly 
by Tsai and Erber [33, 34]. They derive an effective action 



r = - dx 



(6.15) 



where M^jy is a differential operator acting on the electromagnetic field A'^ . Writing the 
corresponding equation of motion and making the geometric optics ansatz as usual, we find 
the light cone condition 

- M^^{k)a^'a!' = (6.16) 

Denoting M^i^a'^a'' by Afy , M± for the two polarisations, the complete expression for 
the birefringent refractive index is: 

^ph ||,±(w) = 1 + 2^ ^lU 

= ' + IJ^'T'' ' e-(^-^-^-(-)) (6.17) 

where z = and $7 = ^ — . The functions N and P are given by: 

and 

^11 =-^(i-^' + ^) + SI = i(i-«^)(i-K) + o(^') 

^± = + ^"tsi^T^" + ^(^°^4Vr^^ = 1(1 -U2)(l + 1^2) + 0(^2) (6.19) 

In the weak field, low frequency limit, we can disregard the function P and consider 
only the lowest term in the expansion of N in powers of z. This reproduces the results for 
Vp\i{uj 0) shown in Eq.(6.14). The weak field, high frequency limit is analysed in ref.[34]. 
It is shown that 



Vp^wA^^^) ~ l + 4^(— J [c||,cJ0-3 (6.20) 

where the numerical constants are [c||,cx] = ^J^^/^TV{^)'^{^{^))~^ [3||,2x]. 

The complete frequency dependence of v^-^{u), or equivalently n(a;), is therefore known 
and shows exactly the features found in the simple absorption model described above. 
In particular, the phase velocity v^\y{uj) begins less than 1 at low frequencies, showing 
birefringence but conventional subluminal behaviour. In the high frequency limit, however, 

_4 

the phase velocity approaches c from the superluminal side with a a; 3 behaviour. 

The origin of this non-analytic high-frequency behaviour is of particular interest. It 
arises from the phase factor exp [— is^il^P(n, z)] in the effective action. The absorption 
resonance region in the graph of the refractive index n{uj) occurs for frequencies uj such 
that ~ 1. For high frequencies, even at weak fields, we need O ^ oo. Clearly, these 
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aspects of the behaviour of n{uj) arc invisible to a simple expansion of the effective action 
in low powers of the field strength B, even keeping terms of all orders in derivatives. 
The conclusion is that a complete analysis of resonant and high frequency propagation 
requires the evaluation of the non-perturbative (in a field strength expansion) phase factor 
in Eq.(6.17). As we now see, this poses severe problems for the analysis of high frequency 
propagation in the case of a background gravitational field. 

6.4 Dispersion and Gravity 

Setting aside this concern for the moment, we now return to gravity and begin our study 
of dispersion by constructing the generalisation of the Drummond-Hathrell effective action 
containing all orders in derivatives, while retaining only terms of 0{RFF) in the curvature 
and field strength. This action was recently derived in ref.[4]. The result is: 

r = Jdx^ -iF^.F^^'^ + ^{d^f^>^ g'o D^F'^^ 

+ gI RF^^Fi^-' + R^^Ff'^F-'x + R^,^.xpFf'^' F^p') 
+ ^(gI RD^Ft^>^D,F'^x + gI R^.xpDaF'^PD^Ff'- 
+ gI R^,DxF>^i^DpFP- + R^^D'^F^^D'^Fxp 
+ G^r R^^D^D^F^PFxp + gI R^.^D^D^FxpFP'' ) (6.21) 

This was found by adapting a background field action valid to third order in generalised 
curvatures due to Barvinsky, Guscv, Zhytnikov and Vilkovisky [26] (see also ref.[27]) and 
involves re-expressing their more general result in manifestly local form by an appropriate 
choice of basis operators. 

In this formula, the G„ (n > 1) are form factor functions of three operators: 

^ £)2^ Jj2^^ 

Gn = Gn[ — — — o- ) (6.22) 

where the first entry {D^\)) acts on the first following term (the curvature), etc. Go is 
similarly defined as a single variable function. These form factors are found using heat 
kernel methods and are given by 'proper time' integrals of known algebraic functions. 
Their explicit expressions can be found in ref.[4]. Evidently, Eq.(6.21) reduces to the 
Drummond-Hathrell action if we neglect all the higher order derivative terms. 

The next step is to derive the equation of motion analogous to Eq.(4.1) from this 
generalised effective action and to apply geometric optics to find the corresponding light 
cone. This requires a very careful analysis of the relative orders of magnitudes of the 
various contributions to the equation of motion arising when the factors of in the form 
factors act on the terms of 0{RF). These subtleties are explained in detail in ref.[3]. The 
final result for the new effective light cone has the form 

m V m / m V m 



oFi-YFi.xk'^k^ + -^G[-^]Cp,xpk^'k\''aP = (6.23) 
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where F and G are known functions with weh-understood asymptotic properties [3]. Clearly, 
for agreement with Eq.(3.8), we have F(0) = 26 + 4c, G(0) = 8c. 

The novel feature of this new light cone condition is that F and G are functions of 
the operator k.D acting on the Ricci and Riemann tensors.^ So although the asymptotic 
behaviour of F and G as functions is known, this information is not really useful unless 
the relevant curvatures are eigenvalues of the operator. On the positive side, however, k.D 
does have a clear geometrical interpretation - it simply describes the variation along a null 
geodesic with tangent vector A;'*. 

The utility of this light cone condition therefore seems to hinge on what we know 
about the variations along null geodesies of the Ricci and Riemann (or Weyl) tensors. 
Unfortunately, we have been unable to find any results in the general relativity literature 
which are valid in a general spacetime. To try to build some intuition, we have therefore 
again looked at particular cases. The most interesting example in this case is photon 
propagation in the Bondi-Sachs metric [29, 30] which we recently studied in detail [31]. 

The Bondi-Sachs metric describes the gravitational radiation from an isolated source. 
The metric is 

= -Wdu'^ - 2e^^dudr + r'^hij{dx' - U'du){dx^ - Wdu) (6.24) 

where 

hijdx'dx^ = ^ (e^T + e'^^)de^ + 2 sinh(7 - S) sin Odedcj) + ^ (e'^^ + e'^^) sin^ 9d4>'^ (6.25) 

The metric is valid in the vicinity of future null infinity T'^. The family of hyper surfaces 
u = const are null, i.e. g'^" d^udi^u = 0. Their normal vector satisfies 

£f, = d^u => f = 0, e''D^r = (6.26) 

The curves with tangent vector are therefore null geodesies; the coordinate r is a radial 
parameter along these rays and is identified as the luminosity distance. The six independent 
functions W, (5,^,6, W characterising the metric have expansions in i in the asymptotic 
region near X+, the coefficients of which describe the various features of the gravitational 
radiation. 

In the low frequency limit, the light cone is given simply by Eq.(4.9) with T^,y = 0. 
The velocity shift is quite different for the case of outgoing and incoming photons [31]. For 
outgoing photons, fc^ = u;£^, and the light cone, written in Newman- Penrose notation^ is 

= ^ ^(^° + ^o) - (6.27) 

®Note that because these corrections are aheady of 0{a), we cam freely use the usual Maxwell relations 
k.Dk" = and k.Da" = in these terms; we need only consider the effect of the operator k.D acting on 
R^v and R^^xp. 

^Details of the Newman-Penrose formalism can be found, e.g., in refs.[7, 31]. The essential idea is to 

construct a basis of null vectors n'^, and m^, and refer the components of the Ricci and Weyl 
tensors to this basis. The independent components of the Weyl tensor then form a set of 5 complex 
scalars, denoted ^fo, . . . , 3'4. For example, = —C^ii,\p(y'm"l^m''. In the example above, we identify the 
momentum direction for outgoing photons with vnF' is then the complex linear combination oy -|-mx of 
the physical transverse polarisations. 
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while for incoming photons, = Lon^, 

= ± ^(*4 + *3) ~ 0(i) (0.28) 

Now, it is a special feature of the Bondi-Sachs spacetime that the absohite derivatives 
of each of the Weyl scalars ^'o, . . . , ^'4 along the ray direction vanishes, i.e. ^'o, . . . , ^4 
are parallel transported along the rays [30, 32]. In this case, therefore, we have: 

£-D^'o = t-D^i = Q (6.29) 

but there is no equivalent simple result for cither n - D ^'4 oi n ■ D ^q. 

Although it is just a special case, Eq.(6.29) nevertheless leads to a remarkable conclu- 
sion. The full light cone condition Eq.(6.23) applied to outgoing photons in the Bondi-Sachs 
spacetime now reduces to 

2 

± ^ G(0)(*o + *S) = (6-30) 

since I ■ -D^'o = 0. In other words, the low-frequency Drummond-Hathrell prediction of a 
superluminal phase velocity 'yph(O) is exact for all frequencies. There is no dispersion, and 
the wavefront velocity fph(oo) is indeed superluminal. 

This is potentially a very important result. It appears to show that there is at least 
one example in which the wavefront truly propagates with superluminal velocity. If so, 
quantum effects would indeed have shifted the light cone into the geometrically spacelike 
region. 

However, we now have to confront the problem raised above in the discussion of light 
propagation in background magnetic fields. If the gravitational case is similar, this would 
imply that the modified light cone can be written heuristically as 

k' + - dsAr(s,i?) e--(i+^'^'^(^'«)) = (6.31) 

where both M and V can be expanded in powers of curvature, and derivatives of curvature, 
presumably associated with factors of lo as in the last section. The frequency dependent 
factor ri would be $7 ~ -t-;^ ~ O(tt^), where 'i?' denotes some generic curvature com- 
ponent and L is the typical curvature scale. If this is true, then an expansion of the 
effective action to 0{RFF), even including higher derivatives, would not be sufficient to 
reproduce the full, non-perturbative contribution exp[— zs^O^T^]. The Drummond-Hathrell 
action would correspond to the leading order term in the expansion of Eq.(6.31) in powers 
of ^ neglecting derivatives, while our improved effective action (6.21) sums up all orders 
in derivatives while retaining the restriction to leading order in curvature. 

The omission of the non-perturbative contribution would be justified only in the limit 
of small Q, i.e. for ^ 1- Neglecting this therefore prevents us from accessing the 
genuinely high frequency limit A needed to find the asymptotic limit Uph(oo) of the 
phase velocity. Moreover, assuming Eq.(6.31) is indeed on the right lines, it also seems 
inevitable that for high frequencies (large 0,) the rapid phase variation in the exponent will 
drive the entire heat kernel integral to zero, ensuring the wavefront velocity f^f = c. 
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At present, it is not clear how to make further progress. The quantum field theoretic 
calculation required to find such non-perturbative contributions to the eflcective action and 
confirm an exp[— is^Q^P] structure in Eq.(6.31) appears difficult, although some technical 
progress in this area has been made recently in ref.[36] and work in progress [37]. One of 
the main difficulties is that since a superluminal effect requires some anisotropy in the cur- 
vature, it is not sufficient just to consider constant curvature spacctimcs. (Recall that the 
Ricci scalar term in the effective action (3.8) does not contribute to the modified light cone 
(4.2).) A possible approach to this problem, which would help to control the plethora of 
indices associated with the curvatures, might be to reformulate the heat kernel calculations 
directly in the Newman-Penrose basis. On the other hand, perhaps a less ambitious goal 
would be to try to determine just the asymptotic form of the non-perturbative contribution 
in the Q ^ oo limit. 

A final resolution of the dispersion problem for QED in curved spacetime has therefore 
still to be found. At present, the most likely scenario appears to be that high-frequency 
dispersion is driven by non-perturbative contributions to the effective action such that the 
wavefront velocity remains precisely c. It would then be interesting to see exactly how 
the phase velocity behaves as a function of lv. What we have established is that for 'sub- 
resonant' frequencies the phase velocity may, depending on the polarisation, take both the 
conventional form of Fig. 4 or the superluminal mirror image where fph(O) > 1. We have 
even seen in the Bondi-Sachs example a special case where fph('^) ~ const > 1 for relatively 
low frequencies. 

However, even if this picture is correct and the light cone is eventually driven back 
to A:^ = in the high frequency limit, the analysis described here still represents a crucial 
extension of the domain of validity of the superluminal velocity prediction of Drummond 
and Hathrell. Recall from section 4.3 that the constraint on the frequency for which the 
superluminal effect is in principle observable is ^ 3> 1. Obviously this was not satisfied 
by the original a; ~ derivation. However, our extension based on the generalised effective 
action (6.21) does satisfy this constraint. Combining with the restriction ^ 1 in which 
the neglect of the V type corrections is justified, we see that there is a frequency range 

I » A » I (6.32) 

below the 'resonance' region of Fig. 4, for which our expression (6.23) for the modified light 
cone is valid and predicts in principle observable effects. 

Since this formula allows superluminal corrections to both the phase and the signal 
velocities, we conclude that superluminal propagation has indeed been established as an 
observable phenomenon even if, as seems likely, causality turns out to respected through 
the restoration of the standard light cone A;^ = in the asymptotic high frequency limit. 

7. Speculations 

In this paper, we have reviewed some of the qualitatively new phenomena that occur when 
quantum theory is introduced into the already rich field of gravitational optics. The picture 
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that arises is of curved spacetime as an optical medium, characterised by a novel refractive 
index and displaying polarisation-dependent properties such as birefringence. 

Naturally, since these phenomena arise due to intrinsically quantum field theoretic 
processes such as vacuum polarisation, they are extremely small for the curvatures typical 
of present day astrophysics. A typical order of magnitude for the shift in the speed of light 
around a black hole, for example, is 0{am^i/ (m^M^)), where m is the electron mass setting 
the scale for vacuum polarisation and M is the black hole mass. Quantum gravitational 
optics (QGO) is therefore only likely to play an important role in situations of extreme, 
quantum-scale, curvature such as may be present in the very early universe or in the vicinity 
of primordial black holes. Its main interest, as in the analogous case of Hawking radiation, 
is likely to be primarily conceptual. 

Indeed, serious conceptual issues do arise through the apparent prediction in certain 
circumstances of a superluminal speed of light. We have explained carefully how this oc- 
curs and discussed the subtle questions of interpretation that arise. Although further work 
remains to be done to achieve a full understanding, it appears that despite an anoma- 
lous refractive index featuring a superluminal low-frequency phase (and signal) velocity, 
the critical high-frequency limit which determines the characteristics or wavefront velocity 
remains equal to c. Nonetheless, we described how superluminal velocities could be ac- 
commodated in the framework of general relativity while preserving the essential notion of 
stable causality. 

All this work was within the relatively uncontroversial theoretical formalism of quan- 
tum electrodynamics in curved spacetime. To close, we give a brief survey of other more 
speculative ideas in the current literature which overlap in some way with the ideas pre- 
sented here. 

First, we should note that everything that has been said about the QED effective 
action arises equally in the low-energy effective actions derived from string theory. SEP- 
violating interactions of 0{RFF) naturally occur there too, with the main difference that 
the electron scale m becomes the Planck mass m^;. Evidently this simply reduces the 
relative order of magnitude by another huge factor. 

One of the key insights in the theory presented here is that in QGO the usual light 
cone relation for the photon momentum becomes 

G^.p'^p'' = (7.1) 

That is, the propagation of real photons is governed by a new metric G^;^, which may 
in general depend not only on the curvature and its derivatives but also on the photon 
direction, polarisation and energy. The physical light cones are then distinct from the 
geometric null cones characterised by the spacetime metric g^i,. This realisation of QGO 
is therefore intrinsically a bimetric theory. 

Inspired by this, it is interesting to speculate on modifications to conventional gen- 
eral relativity in which a bimetric structure is imposed a priori. Phenomenologically, this 
has the advantage that the new scale characterising the propagation metric can be a free 
parameter, with the result that the type of phenomena discussed here could occur on macro- 
scopically accessible scales. As a result, there is a strong motivation to search for unusual 



-24- 



velocity or polarisation dependent phenomena in light signals from cosmological sources. 
Bimetric theories have a long history, with a particularly elegant recent construction due 
to Drummond [38], who has proposed an extension of general relativity in which matter 
couples to its own vierbein, which is rotated relative to the geometric vierbein according 
to an appropriate sigma model dynamics. By modifying gravitational dynamics on an as- 
trophysical scale, such theories have the potential to offer an alternative to dark matter 
models. Other recent work exploiting bimetric theories to resolve cosmological puzzles can 
be found in a series of papers on scalar-tensor gravity by Clayton and Moffat [39, 40]. 

These theories modify general relativity, and by incorporating a bimetric structure 
they necessarily involve a varying speed of light (VSL). VSL theories and their potential 
for providing an alternative to inflation as a resolution of cosmological problems such as 
the horizon problem have seen a rapid rise in popularity in recent years. A small selection 
of relevant papers is included here as an introduction to this literature for the interested 
reader [41, 42, 43, 44, 45, 46]. 

The QED version of QGO discussed in this paper, and some of the bimetric mod- 
els, preserve the essential principles of general relativity, especially the weak equivalence 
principle and local Lorentz invariance. We can of course speculate that local Lorcntz in- 
variancc itself may be violated. A phenomenological approach to such theories has been 
pioneered by Kostclccky, who has constructed a general extension to the standard model 
incorporating Lorentz and CPT violating interactions and investigating their experimental 
consequences [47]. Of particular interest here are the following interactions, which affect 
the propagation of light: 



Here, K and L are Lorentz-violating couplings, which could arise as VEVs in some more 
fundamental theory exhibiting spontaneous Lorentz breaking. Evidently, the first of these 
interactions is the analogue of the SEP-violating interaction considered in this paper, with 
the replacement of the covariant curvature tensor R^^Xp by the coupling K^i^xp- The 
phenomenology of light propagation, and some (though not all) of the considerations con- 
cerning causality discussed here therefore apply directly to this class of Lorentz-violating 
theory. The CPT violating Chern-Simons interaction does not of course arise in QED, 
though it can be induced by anomalies involving spacetime topology [48]. The Kostelecky 
lagrangian has been used as a phenomenological model against which to put limits on the 
magnitude of VSL, polarisation-dependent or Lorentz-violating effects in light propaga- 
tion. Since a birefringent speed of light would cause a rotation of the polarisation of light 
as it travels through space, the observation of polarisation from distant quasars and radio 
galaxies, and also recent measurements of CMB polarisation, can be used to place limits 
on the phenomenological couplings K and L. Details of this work can be found in ref . [49] . 
Similar interactions are induced also in non-commutative gauge theories as a consequence 
of the Moyal product, with similar phenomenological implications. Here, the analogue of 
the coupling K^^xp is constructed from the non-commutativity parameter 6*^^ defined from 




(7.2) 
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Of course, once we permit models with more or less ad hoc Lorentz violation, then we 
are free to speculate at will as to alternative dispersion relations for the propagation of 
light, or indeed neutrinos. Some recent popular models invoke a supposed non-realisation 
of Lorentz invariance in the effective low-energy field theory derived from quantum gravity 
[50, 51, 52], non-linear realisations of Lorentz symmetry preserving the Planck length as 
an invariant [53], so-called 'doubly special' relativity [54], etc. Evidently, there is immense 
freedom to frame such models within the constraints of experimental data though it is far 
less clear how to incorporate them into complete theories consistent with our understanding 
of unified theories of particle physics and quantum gravity. On the experimental side, 
particular interest attaches to light (or neutrinos) received from gamma-ray bursts, since 
the extremely high energies of the photons can enhance the size of the VSL effect for certain 
Lorentz-violating dispersion relations. 

This rapid tour of admittedly rather speculative models is hopefully sufficient to in- 
dicate the considerable current interest in looking for anomalies in the standard theory 
of light propagation in special and general relativity. However, as we have shown in this 
paper, it is not necessary to invoke ad hoc violations of the fundamental principles of quan- 
tum field theory, general relativity or string theory in order to discover a new richness 
of phenomena in the propagation of light. Whether its primary interest will lie in the 
conceptual issues it raises for the consistency of quantum field theory with gravity, or in 
the prediction of experimental anomalies in the speed of light in cosmology, the field of 
quantum gravitational optics is promised a bright future. 
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